Comparison of eigensolvers for symmetric band matrices
نویسندگان
چکیده
We compare different algorithms for computing eigenvalues and eigenvectors of a symmetric band matrix across a wide range of synthetic test problems. Of particular interest is a comparison of state-of-the-art tridiagonalization-based methods as implemented in Lapack or Plasma on the one hand, and the block divide-and-conquer (BD&C) algorithm as well as the block twisted factorization (BTF) method on the other hand. The BD&C algorithm does not require tridiagonalization of the original band matrix at all, and the current version of the BTF method tridiagonalizes the original band matrix only for computing the eigenvalues. Avoiding the tridiagonalization process sidesteps the cost of backtransformation of the eigenvectors. Beyond that, we discovered another disadvantage of the backtransformation process for band matrices: In several scenarios, a lot of gradual underflow is observed in the (optional) accumulation of the transformation matrix and in the (obligatory) backtransformation step. According to the IEEE 754 standard for floating-point arithmetic, this implies many operations with subnormal (denormalized) numbers, which causes severe slowdowns compared to the other algorithms without backtransformation of the eigenvectors. We illustrate that in these cases the performance of existing methods from Lapack and Plasma reaches a competitive level only if subnormal numbers are disabled (and thus the IEEE standard is violated). Overall, our performance studies illustrate that if the problem size is large enough relative to the bandwidth, BD&C tends to achieve the highest performance of all methods if the spectrum to be computed is clustered. For test problems with well separated eigenvalues, the BTF method tends to become the fastest algorithm with growing problem size.
منابع مشابه
A Parallel Bisection and Inverse Iteration Solver for a Subset of Eigenpairs of Symmetric Band Matrices
The tridiagonalization and its back-transformation for computing eigenpairs of real symmetric dense matrices are known to be the bottleneck of the execution time in parallel processing owing to the communication cost and the number of floating-point operations. To overcome this problem, we focus on real symmetric band eigensolvers proposed by Gupta and Murata since their eigensolvers are compos...
متن کاملA performance study of the PLAPACK and ScaLAPACK Eigensolvers
This report compares the performance of two packages available on HPCx, PLAPACK and ScaLAPACK, for the solution of the symmetric, standard eigenvalue problem. Both PLAPACK and ScaLAPACK provide parallel eigensolvers based on the QR algorithm for dense real symmetric matrices. These packages also provide newly developed algorithms: PLAPACK contains an eigensolver based upon the Multiple Relative...
متن کاملPerformance Analysis of Parallel Eigensolvers of Two Libraries on BlueGene/P
Many applications in computational science and engineering require the computation of eigenvalues and vectors of dense symmetric or Hermitian matrices. For example, in DFT (density functional theory) calculations on modern supercomputers 10% to 30% of the eigenvalues and eigenvectors of huge dense matrices have to be calculated. Therefore, performance and parallel scaling of the used eigensolve...
متن کاملOptimizing Locality of Reference in Symmetric Eigensolvers
This paper shows how the symmetric eigenproblem, which is the computationally most demanding part of numerous scientiic and industrial applications, can be solved much more eeciently than by using algorithms currently implemented in Lapack routines. The main techniques used in the algorithm presented in this paper are (i) sophisticated blocking in the tridiagonalization, which leads to a two-sw...
متن کاملNumerical Experiments with Symmetric Eigensolvers
This report describes and analyzes numerical experiments carried out with various symmetric eigensolvers in the context of the material science code Wien 97. Of particular interest are the performance improvements achieved with a new Level 3 eigensolver. The techniques which lead to a signi cant speed up are (1) sophisticated blocking in the tridiagonalization step, which leads to a twosweep al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 90 شماره
صفحات -
تاریخ انتشار 2014